2-FOLD AND 3-FOLD MIXING: WHY 3-DOT-TYPE COUNTEREXAMPLES 
ARE IMPOSSIBLE IN ONE DIMENSION 



THIERRY DE LA RUE 



Abstract. V.A. Rohlin asked in 1949 whether 2-fold mixing implies 3-fold mixing for a sta- 
tionary process (£i)igg, and the question remains open today. In 1978, F. Ledrappier exhibited 
a counterexample to the 2-fold mixing implies 3-fold mixing problem, the so-called 3-dot system, 
but in the context of stationary random fields indexed by 2? 2 . 

In this work, we first present an attempt to adapt Lcdrappier's construction to the one- 
dimensional case, which finally leads to a stationary process which is 2-fold but not 3-fold 
mixing conditionally to the a-algebra generated by some factor process. Then, using arguments 
coming from the theory of joinings, we will give some strong obstacles proving that Lcdrappier's 
counterexample can not be fully adapted to one-dimensional stationary processes. 



1. Introduction: Rohlin's multifold mixing problem and Ledrappier's 
two-dimensional counterexample 

The following work is based on two recent results concerning Rohlin's multifold mixing problem 
which are contained in ^7] and [T^]. It seemed to me interesting to put these results together 
and show them in a different light, emphasizing mainly on the underlying ideas rather than on 
technical details. 

The object of our study is a stochastic process, that is to say a family £ = (£j)iez of random 
variables indexed by the set of integers, and we will always assume that these random variables 
take their values in a finite alphabet A. If two integers i < j are given, we will denote by ^ the 
finite sequence ...,£,-). Obvious generalization of this notation to the case where i = — oo 

or j = +oo will also be used. 

We are more particularly interested in the case where the stochastic process is stationary, which 
means that the probability of observing a given cylindrical event E (i.e. an event depending only 
on finitely many coordinates) at any position i £ 2 does not depend on i: 

(1) W > 0, V£ c A e+1 , Vi e z, p(ej +f G e) = p(^ e e) . 

Another way to characterize the stationarity of the process is to say that its distribution is invariant 
by the coordinate shift: Let T : A % — > A % be the transformation defined by T(£) = £, where for 
all i G TL, £j := Then the stochastic process £ is stationary if and only if the distribution of 

T(£) is the same as the distribution of £. 

The stochastic process £ is said to be mixing if, considering two windows of arbitrarily large 
size £, what happens in one window is asymptotically independent of what happens in the second 
window when the distance between them tends to infinity: 

(2) W > 0, VE U E 2 c A l+ \ g E u $ +e G E 2 ) - G £?i)p(^ G E 2 ) —— * 0. 

1.1. Rohlin's question. In 1949, V.A. Rohlin [Tl| proposed a strengthening of the previous 
definition involving more than two windows: £ is said to be 3- fold mixing if 

(3) W >0, VE l7 E 2 ,E 3 c A e+ \ 

p($ e E 1 ,e P +e e E 2 ,e P t q q +e e m) p($s e ^)f(^ g E 2 y[e P t q q +e e e 3 ) ~~ > o. 
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A straightforward generalization to k windows naturally gives rise to the property of being k-fold 
mixing. To avoid any confusion, we will henceforth call the classical mixing property defined 
by (0): 2- fold mixing 1 . 

Rohlin asked whether any stationary process which is 2-fold mixing is also 3-fold mixing. This 
question is still open today, but a large number of mathematical works have been devoted to the 
subject. Many of these works show that 2-fold mixing implies 3-fold mixing for special classes of 
stationary processes (see e.g. ^3] an d [22 f° r Gaussian processes, [7] for processes with singular 
spectrum, [5| and |2U] for finite-rank processes). 

1.2. Ledrappier's counterexample in 2 dimensions: the 3-dot system. In the opposite 
direction, Ledrappier |12j produced in 1978 a counterexample showing that in the case of stationary 
processes indexed by T? (we should rather speak of stationary random fields in this context), 2-fold 
mixing does not necessarily imply 3-fold mixing. Here is a description of his example: Consider 

G := G {0, 1} 2 " : V(i,i), &j + fo-ij + fcj+i = mod 2} . 

Let us describe a probability law \x on G by the way we pick a random clement in G: First, 
use independent unbiased coin tosses to choose the £^0 on the horizontal axis (one coin toss 
for each i G %: these random variables are independent). Now, note that the "3-dot rule" 
+ d+i.j + Ci-j+i — mod 2 for each completely determines the coordinates £jj on the 
upper-half plane j > 0. It remains to choose the fjj for j < 0. For this, observe that we have yet 
no constraint on £o,-l- We choose it with an unbiased coin toss, and then the entire line is 
completely determined by the 3-dot rule. To complete the whole plane, we just have to pick each 
of the £o,j (j < ~1) with a coin toss, and then inductively fill each horizontal line with the 3-dot 
rule. 
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Figure 1. Generation of a random configuration in G. First, use independent 
coin tosses to choose the values of the shaded cells, then apply the 3-dot rule to 
complete the others: Three adjacent cells disposed as the three dotted ones must 
contain an even number of l's. 

The addition mod 2 on each coordinate turns G into a compact Abelian group. We let the 
reader check that the probability law fi defined above on G is invariant by addition of an arbitrary 
element of G, thus fi is the unique normalized Haar measure on G. Since any shift of coordinates 
in 71? is an automorphism of the group G, such a shift preserves /i. Hence [i turns into a 

stationary random field. 

The definition of fc-fold mixing for a stationary random field is formally the same as in the case 
of processes, except that a window is no longer an interval on the line but a square in the plane: 
{(io + i,jo + j) ■ < i, j < £} for some (io,jo) £ Tl? and some I > 0. Let us sketch a geometric 
argument showing why the 2-fold mixing property holds for Starting with the cells on the 

We must point out that in Rohlin's article, the definition of fc-fold mixing originally involved k + 1 windows, 
thus the classical mixing property was called 1-fold mixing. However it seems that the convention we adopt here is 
used by most authors, and we find it more coherent when translated in the language of multifold self-joinings (see 
section \i. II . 
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horizontal axis and the lower-half vertical axis filled with independent coin tosses , we observe 
that, when filling the other cells using the 3-dot rule, 

• the region R\ := ■ i < 0, < j < —i} only depends on the cells (i, 0), for i < 0; 

• the region i?2 := {(hj) '■ J < 0, < i < —j} only depends on the cells (0, j), for j < 0; 

• the region Rq :— ■ < i, < j} only depends on the cells (i,0), for i > 0. (Sec 
Figure H) 

These three regions are therefore independent. Now, if we take two windows of size t, and if the 
distance between them is large enough ("large enough" depending on £). it is always possible to 
shift the coordinates in such a way that each of the shifted windows entirely lies in one of these 
three regions, and not both in the same region. The two shifted windows are then independent, 
and since fi is preserved by coordinate shift, this means that the two windows we started with are 
also independent. 



Ri \ 

□ \y 


R 3 

□ 




R2^\ 



Figure 2. 2-fold mixing for the 3-dot system: If the distance between them is 
large enough, the two square windows can be shifted in such a way that one lies 
in one of the three colored regions, and the other one in another, independent, 
region. 

It remains to see why Ledrappier's example is not 3-fold mixing. For this, apply the 3-dot rule 
from corner from corner (i + and from corner (i, j + 1), then add the three equalities 

(see Figure |2J. In the sum, the random variables £i+i,j> Ci+i.j+i an d are counted twice, 

thus they vanish since we work modulo 2. We get the following equality, which could be called 
the scale-2 3-dot rule: 

(4) Zij + &+2,j + 6,i+2 = mod 2. 

A straightforward induction then shows that for any n > 0, the scale-2 n 3-dot rule holds: 

(5) &,i +&+2»,i +&,3+2» = mod 2. 

But this shows that the three windows of size 1 {(«, j)}, {(i + 2™, j)} and {(i, j + 2™)} are always 
far from being independent, although the distance between them can be made arbitrarily large. 
Hence the random field £ is not 3-fold mixing. 

2. Attempt to construct a 3-dot- type one-dimensional process 

2.1. Block construction of a 2-fold but not 3-fold mixing process. In this section we 
describe a naive attempt to mimic the 3-dot construction on a one-dimensional process. Our 
process will take its values in the same alphabet A = {0, 1} as for Ledrappier's example, and we 
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Figure 3. Applying the 3-dot rule from three different corners (i + 

and (i, j + 1), and adding the three equalities gives the scale-2 3-dot rule. 



start by randomly picking the two random variables £o and £i with two independent unbiased coin 
tosses : £o and £i are independent, and each one is equal to 1 with probability 1/2. Then, we set 

6 := ft + 6 mod 2. 

Each random variable & is called a 0-block, and the triple (CcbCijft) is called a 1-block. Wc pick 
the second 1-block (£3, £4, £5) m the same way as the first one, but independently. The third 
1-block (^6,^7,^8) is now set to be the pointwise sum of the first two 1-blocks: 



ft 


:= ft" 


hft 


mod 2, 


ft 


:= ft" 




mod 2, 


ft 


:= ft" 


Kb 


mod 2. 



Observe that this third 1-block follows the same distribution as the first two: ft and ft are two 
independent Bernoulli random variables with parameter 1/2, and ft is the sum mod 2 of these 
variables. Note also that the third 1-block is independent of the first one, independent of the 
second one, but of course not independent of the first two together. The 9-tuple (£0, ft) • • • 1 ft) is 
called a 2-block. 

We can repeat this procedure inductively to construct fc-blocks for each k > 0: Suppose that 
for some k we already have constructed the first fc-block, which is the 3 fe -tuple (ft, • ■ ■ , ft>-i)- 
Then, choose the second fc-block (£ 3 fc £2x3^-1) with the same probability distribution, but 
independently of the first one, and set the third fc-block (( 2x3 k, . . . ,£3^+1-1) to be the pointwise 
sum of the first two fc-blocks: 

(6) £ 2x3 * +j := + £ 3fc+J mod 2 (0 < j < 3 k - 1). 
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Figure 4. Block construction of a stochastic process: The shaded coordinates 
are given by independent coin tosses . The non-shaded coordinates are computed 
from the shaded ones by 3-dot-type rules. 

This inductive procedure gives the construction of a stochastic one-dimensional process (£i)i>o- 
(This construction can easily be extended to a process indexed by TL: Set the negative coordinates 
independently of the nonnegative ones by a similar symmetric construction.) Let us sketch the 
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proof that our process is 2-fold mixing. For this, we use the two following facts, whose verification 
is left to the reader: 

• Two different fc-blocks are always independent. 

• Call a fc- overlapping the concatenation of two consecutive (k — l)-blocks lying in two dif- 
ferent fc-blocks. Any fc-overlapping is independent of any concatenation of two consecutive 
(fc — l)-blocks lying in other fc-blocks. 

Now, take two windows of fixed size £, and let k be an integer such that £ < 3 k ~ 1 . Then, if the 
distance between the two windows is greater than 3 fc , either they lie in two different fc-blocks, or 
at least one of them lie in a fc-ovcrlapping. In both cases the two windows are independent. 
However, the stochastic process £ is clearly not 3-fold mixing, since for any fc > 0, we have 

Co + + = mod 2 - 

This, of course, does not make £ a counterexample to Rohlin's question: The process we have just 
constructed is not a stationary one! Indeed, the pattern '111' for example can not be seen in the 
sequence £o£i£2j but it can occur in the sequence £i£2&3 with probability 1/8. 

2.2. How to make the construction stationary. The example described in the preceding 
section can be turned into a stationary process by applying some trick which is presented here. 
The process is still inductively constructed with fc-blocks which follow the same distribution as 
before. The difference consists in the way fc-blocks are extended to (fc + l)-blocks. Observe that 
a fc-block lying in a given (fc + l)-block can have three positions, which will be denoted by '0' 
(the first fc-block in the (fc + l)-block), '1' (the second one) and '2' (the third one). We are 
going to define the increasing family of fc-blocks (fc > 0) containing the coordinate £o by using a 
sequence S = (Sk)k>o of independent, uniformly distributed random variables, taking their values 
in {0,1,2}. 

We start the construction by picking the first 0-block £o in the usual way, with a coin toss. 
Now, we have to decide whether this 0-block is in the first, second or third position in the 1-block. 
This is done by using the first random variable Sq. Next, we complete the 1-block by tossing a 
coin for the first missing variable, and setting the last one to be the sum mod 2 of the two others. 
The extension from the fc-block to the (fc + l)-block containing £o goes on in a similar way: Once 
we have determined the fc-block, we use the random variable Sk to decide whether this fc-block is 
in the first, second or third position in the (fc + l)-block. Then the first missing fc-block is chosen 
independently, and the last one is set to be the pointwise sum of the two other fc-blocks. 
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Figure 5. Beginning of the construction with the skeleton sequence So = 1, 
Si = 2, and S 2 = 0. 

The embedding of fc-blocks in fc + 1-blocks is called the skeleton of the process, and the i.i.d. 
sequence (Sfc)fc>o coding this embedding is the skeleton sequence. Since almost every realization 
of the skeleton sequence contains infinitely many l's, the preceding procedure applied for all fc > 
gives rise to fc-blocks extending arbitrarily far away from on both sides with probability one. 
This defines the whole process £ = (£i)ie^- 

Let us see how the skeleton sequence evolves when a coordinate shift is applied to the process 
£. It is not difficult to convince oneself that a shift of one coordinate to the left corresponds to the 
addition of T' on the 3-adic number defined by the sequence (Sk)- (Write the sequence from right 
to left, and see it as a "number" written in base 3 with infinitely many digits, So being the unitary 
digit; then add '1' to the sequence as you would do it for an ordinary number: Add '1' to So, and 
if So reaches 3, then set So = and add to Si, and so on.) Observe that the distribution of the 
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skeleton sequence is the same after this addition of '1', hence the distribution of the skeleton is 
invariant under the action of the coordinate shift. But once the skeleton is fixed, the distribution 
of the process is entirely determined by giving the distribution of fc-blocks for every k > 0, which 
is the distribution described in the preceding section. Therefore the whole distribution of the 
process is invariant under the coordinate shift, and the process we get now is stationary. 

t = 0, S = •••0 2 1 

t = l, S = ---02 2 









t = 2 
t 


S= 
= 3, 5 = •••10 1 
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1 1 




EES 
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10 1110 11 



Figure 6. Action of the coordinate shift on the skeleton sequence. The arrows 
denote the position of £o at successive times during the iteration of the shift. 

Unfortunately, making the process stationary has a cost: We have lost the 2-fold mixing prop- 
erty! Indeed, if for example we look at the rightmost coordinate So of the skeleton sequence, we 
see that any realization of the process £ gives rise for So to the periodic sequence 

• •• 012012012- •• 

So, the process we get when we only observe So is periodic. But if £ was 2-fold mixing, then every 
factor of £ (that is to say, every stationary process which can be seen as a function of £, such as 
the process generated by So for example 2 ) would also be 2-fold mixing. 

2.3. A relative counterexample to Rohlin's question. The stationary process generated by 
the whole skeleton sequence S is well-known in ergodic theory, and is called the 3-adic odometer. 
(Be careful: S does not take its values in a finite alphabet, it has infinitely many coordinates 
taking their values in {0,1,2}.) This process, which has appeared as a factor of £ in our new 
construction, is far from being mixing, since each of its coordinates is periodic. However, we can 
notice some interesting facts regarding the 2-fold and 3-fold mixing properties of £. Namely, once 
the skeleton is fixed (that is to say, conditionally to the er-algebra generated by S), the mixing 
properties of £ are similar to those of the non-stationary process constructed in Sect ion [2. II Thus, 
the process £ is 2-fold, but not 3-fold mixing relatively to the factor a-algebra generated by S. 
(More details on relative /c-fold mixing can be found in |17|.) 

It is a common idea in abstract ergodic theory to say that the study of stationary processes 
relatively to their factor a-algebras gives rise to similar results as in the absolute study (one of 
the best examples of this fact is Thouvenot's relative version of Ornstein's isomorphism theorem 
|21j : another example is the proof of Proposition ^ . 2l prcsentcd below). Therefore, the process that 
we have just constructed could make us think that a one-dimensional counterexample to Rohlin's 
question should exist. However, we are going to show in the next Section that, if such a process 
exists, it must be of a different nature than this one, or than Ledrappicr's counterexample in H? . 

3. Obstruction to the construction of a 3-dot-type one-dimensional 

counterexample 

3.1. Multifold mixing and self-joinings. We need now to present a powerful tool which has 
been introduced in ergodic theory by Furstenberg 0]: The notion of self-joining of a stationary 
process. Let £ = (£j)iez be a stationary process taking its values in the alphabet A, and denote 



We leave as an exercise for the reader the verification of the fact that the skeleton sequence is indeed a function 

off. 
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by /i its probability distribution on A z . Take £' another process defined on the same probability 
space, taking its values in the same alphabet A, and following the same distribution /i. Then we 
can consider the joint process (£, £'), taking its values in the Cartesian square A x A. If this joint 
process is still stationary, then we say that its distribution A on A z x K % « (A x A) z is a 2-fold 
self-joining of £. In other words, a 2- fold self-joining of £ is a probability distribution on A z x A z 
whose marginals are both equal to /j,, and which is invariant under the coordinate shift. 

Let us see some simple examples of such self-joinings. The first idea is to take the two processes 
£ and £' independent of each other. Then we get fi ® p. as our first example of a 2-fold self-joining. 
Another very simple example is obtained by taking £' = £, and we denote by Ao ("diagonal 
measure") the 2-fold self-joining of £ concentrated on the diagonal of A z x A z . This can be 
generalized by considering the case where £' is equal to a shifted copy of £: We fix some p 6 TL, 
and we set := t;i+ p for each t £ 2. We denote by A p the shifted diagonal measure obtained in 
this way. 

The set J 2 (£) of all 2-fold self-joinings of £ is endowed with the metrizable topology defined by 
the following distance: 

d(Ai,A 2 ) := £>T _L 7 |A 1 (CeC' n ^ / eC n 0-A 2 (£eC n) C'6:C n 0| ) 

n>0 n'>0 

where (C„)„> is the countable collection af all cylinder sets in A z . This topology (which is 
nothing else than the weak topology restricted to the set of 2-fold self-joinings of £) turns Ji{£,) 
into a compact metrizable topological space. The link with the 2-fold mixing property is now 
straightforward: The stationary process £ is 2-fold mixing if and only if the sequence (A p ) of 
shifted diagonal measures converges in J 2 (£) to the product measure \x ® \i as p — > +oo. 

To translate the 3-fold mixing property into the language of joinings, we have to generalize the 
notion of self-joining to the case where 3 processes £, £ and £" with the same distribution p are 
involved. This naturally leads to the definition of a 3-fold self-joining of £. (We can of course 
define an r-fold self-joining of £ for any r > 2, but for our purpose the cases r = 2 and r = 3 
will suffice.) The set Ja(£) is also turned into a compact metrizable space when endowed with the 
restriction of the weak topology. Particularly simple and interesting elements of Ja(£) are again 
the product measure p®p®p, and the shifted diagonal measures A Pjg , p, q £ Z, the latter denoting 
the distribution of the triple (£,£',£") when for all i € Z, 

(7) ^ = Ci+p and £" = 

The process £ is 3-fold mixing if and only if the following convergence holds in Js(£) : 

(8) A pq > fl®H<g>[l. 

Now, let us assume that £ is a 2-fold mixing stationary process which is not 3-fold mixing. 
Then, since A P;9 does not converge to the product measure, we can find a subsequence A Prii q n 
converging to some 3-fold self-joining A^/j®fi0/i. But the 2-fold mixing property of £ tells us 
that, under A, the 3 processes £, £' and £" have to be pairwise independent. Hence, we get the 
following conclusion: 

Proposition 3.1. If £ is a 2-fold mixing stationary process which is not 3-fold mixing, then £ has 
a 3-fold self-joining A ^ p ® p <S> p with pairwise independent coordinates. 

3.2. Restriction to zero-entropy processes. The natural question now is whether stationary 
processes satisfying the conclusion of Proposition 13 . II can exist. But, without extra requirements, 
it is easy to find examples of such pairwise independent self-joinings which are not the product 
measure: Let £ consist of i.i.d. random variables (£ p ) p ez, taking their values in {0, 1}, each one 
with probability 1/2. Take an independent copy (£ p )pez of this process and set 

£p := Hp + Z' P mod 2. 

Then the three processes £, £' and £" have the same distribution, are pairwise independent, but 
the 3-fold self-joining of £ we get in this way is not the product measure. 
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However, this process is not a counterexample to Rohlin's question. Indeed, all its coordinates 
being independent, £ is of course fc-fold mixing for any k > 2. It was Thouvenot who explained 
that this kind of Bernoulli shift situation could be avoided when one studies Rohlin's question, 
because it is always possible to restrict the analysis to the case of zero-entropy stationary processes. 
A stationary process £ = (&)iez taking its values in a finite alphabet has zero entropy if and only 
if £o is measurable with respect to the past, i.e. with respect to the cr-algebra <r(£i, i < 0). (Main 
definitions and results concerning entropy in ergodic theory can be found e.g. in [2103IH]) 

Proposition 3.2. If there exists a stationary process £ which is 2-fold, but not 3-fold mixing, 
then one can find such a counterexample in the class of zero-entropy stationary processes. 

Proof. One of the main ingredient to prove this result is the so-called Pinsker a-algebra (or tail 
field) of the process, which is the cr-algebra 



n(o : = n v^-oo) = n ct ^ +o °)- 



The Pinsker cr-algebra of £ is invariant by the coordinate shift T: (£„) i— > (£„), where £„ := £ n +i- 
Therefore, if we choose any ±T(£)-measurable random variable foj taking its value in a finite 
alphabet B, and if we set for all i G Z 







Co o r 



then the whole stationary process ( = (Ci)«ez is II(£)-measurable. What is remarkable is that 
such a process always has zero entropy, and that any stationary process £ with zero entropy which 
is a factor of £ is automatically n(£)-measurable (see e.g. [JJ], Theorem 18.6). Besides, Krieger's 
finite generator theorem ensures that it is always possible to find such a factor process £, taking its 
values in an alphabet B containing only two letters, and generating the whole Pinsker cr-algebra 
(nice proofs of Krieger's theorem can be found in jS], or ^Hl)- We henceforth fix such a process 
£ with 

n(£) = 4,ie %). 

As a factor of £, £ automatically inherits the 2-fold mixing property. It remains to show that if £ 
is 3-fold mixing, then so is £. 

For this, let us fix three cylinder events E\, E2 and £3, which are measurable with respect to 
£q for some t > 0. We have to compute the limit, as p and q go to +00, of the quantity 
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The martingale convergence theorem gives 
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hence the first term can be bounded by e if p is large enough. We are left with 
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and again the martingale convergence theorem allows us to replace IE l^i^ E2 | <r(£ ( 



t+oo j 



with 
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(9) 



2 |n(0 



if g is large enough. We thus have proven 
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Note that in this equation, the expectation can also be written as 
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Now, if the process £ generating II(£) is 3-fold mixing, we get that this expectation converges, as 
p and q go to +00, to the product 
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= P $ G ^i) P $ G 3a) P (Co G E 3 ) 
which means that £ is also 3-fold mixing. 
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Let us make some comment about this proof. The main argument is a nice illustration of the 
principle that the behaviour of a stationary process conditionally to one of its factors gives rise 
to similar results as in the non-conditioned case. The key notion here is the so-called K -property: 
We say that the stationary process £ has the if-property if its Pinskcr er-algcbra II(£) is trivial. 
It is easy to show that this if-property implies 3-fold mixing: Just write the preceding proof until 
(jSJ in the case where II(£) is trivial, and you get the result. (In fact, the same argument gives 
that the if-property implies fc-fold mixing for any k). Now, when a factor £ of £ is given, we can 
also define the if -property of £ relatively to £ (see e.g. and check that £ always has the 

if-property relatively to its Pinsker er-algcbra: This comes from the fact that any factor of £ with 
zero entropy is n(£)-measurable. But this in turn gives that £ is 3-fold mixing relatively to II(£): 
This is more or less what © says. The end of the proof consists in checking that, if £ is 3-fold 
mixing, and if £ is 3-fold mixing relatively to £, then £ is 3-fold mixing. 

Now that we have reduced Rohlin's problem to the case of zero-entropy processes, we can ask 
the question of pairwise independent self-joinings in this zero-entropy class. 

Question 3.3. Does there exist a zero-entropy, 2-fold mixing stationary process £, and a 3-fold 
self-joining A of £ for which the coordinates are pairwise independent but which is different from 
the product measure? 

Note also that if the assumption of 2- fold mixing is dropped, we can again find some counterex- 
ample: Take a periodic stationary process £ taking its values in {0, 1,2}, where £0 is uniformly 
distributed in the alphabet, and £,+i = £j + 1 mod 3 for all i G TL. Take an independent copy £' 
of £, and set 

£f := 2&-ti mod 3. 

Then £, £' and £" share the same zero-entropy distribution, they are pairwise independent but the 
3-fold self-joining they define is not the product measure. 

3.3. 3-dot-type pairwise independent self-joinings. Let us consider now the two-dimensional 
example of Lcdrappicr from the point of view of self-joinings. The definition of the shifted diagonal 
3- fold self-joining A p „ is formally the same as in (JJJ, but in this case p and q arc both elements 
of H? . Particularly interesting is the sequence A p?i qn , where for all n > 0, 

p n := (0,2™) and q n := (2",0). 

From the relation we see that the sequence A Pni9n converges as n — > +00 to the 3-fold self- 
joining A of £ under which the three coordinates £, £' and £" are pairwise independent (because £ 
is 2-fold mixing), but which is not the product measure since, for all i E H? , 

(10) er = mod 2. 
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Observe also that, for the non-stationary process constructed in Section 12. II the sequence of 3- 
fold self-joinings A.3n t 2x3 n a lso converges to a joining with pairwise independent marginals which, 
by ©, satisfies a relation similar to l|10(l . Since this kind of self-joining seems to appear naturally 
when one tries to construct examples which are 2-fold but not 3-fold mixing, we introduce the 
(slightly more general) following definition. 

Definition 3.4. Let £ be a stationary process taking its values in a finite alphabet A. We call 
3-dot-type self-joining a 3-fold self-joining X of £ which has pairwise independent marginals, and 
for which there exists a map f : A x A — > A such that 

(11) V*eZ, er = (A-a.s.). 

The result that we shall prove now shows that there is no hope to find a 2-fold but not 3-fold 
zero-entropy stationary process which admits a 3-dot-type self-joining: The only 3-dot-type self- 
joinings which can be seen in zero entropy are those arising from periodic (therefore non 2-fold 
mixing) processes, like in the example presented at the end of Section 13.21 

Theorem 3.5. If the stationary process £ admits a 3-dot-type self-joining, then 

• either £ is a periodic process, 

• or £ has entropy at least log 2. 

The proof of the theorem is based on the following lemma. 

Lemma 3.6. Let X , Y and Z be 3 random variables, sharing the same distribution on the finite 
alphabet A. Assume that these random variables are pairwise independent, and that there exists 
a map f : A x A — v A such that 

(12) Z = f(X,Y) (a.s.). 

Then their common distribution is the uniform distribution on a subset of A 

Proof. Taking a subset of A instead of A if necessary, we can assume that each letter of A is seen 
with positive probability. Let us fix some y € A, and condition with respect to the event (Y = y). 
Since X, Y and Z are pairwise independent, we get for any x, z G A 

P(X = x\Y = y) = ¥(X = x) and P(Z = z\Y = y) = ¥(Z = z). 

But knowing (Y = y), we have the equivalence 

(Z = z) ^ (f(X,y) = z). 

Since X and Z can take the same number of values, we deduce that for any z £ A, there exists a 
unique x such that 

f(x,y) = z, 

and that furthemore, this x satisfies 

(13) T{X = x) = V(Z = z). 

Finally, observe that we can condition on (Y = y) for any y £ A, and therefore that l|13|) holds 
for any x and z in A for which we can find a y £ A satisfying fix, y) = z. But since X and Z are 
independent, this is true for any x, z € A. □ 

Proof of Theorem VJ. 51 Fix some m > 1, and apply Lemma f3 . 61 with X := ^q 1-1 , Y := £' m_1 and 
Z := £" " 1 1 under the 3-dot-type self-joining of £: We get that is uniformly distributed 

on the subset of sequences in A m which are seen with positive probability. Denote by p m the 
number of such sequences. The same argument applied with to + 1 in place of m gives that 
each of the p m +i possible sequences of length m + 1 is seen with probability l/p m +i. Therefore, 
any possible sequence of length to has exactly a m :— p m +i/Pm different ways to extend to some 
possible sequence of length m + 1, and each of the possible extensions has a conditional probability 
l/a m . Now, observe that since £ is stationary, the number of possible extensions of a sequence of 
length to + 1 can not be greater than the number of possible extensions of the sequence of length 
to obtained by removing the first letter, which gives the inequality a m+ \ < a m . Therefore, there 
are two cases: 
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• Either a m = 1 for m large enough, and in this case the process £ is periodic; 

• Or a m is always greater than or equal to 2, and then the entropy of £ is at least log 2. 

□ 

3.4. Difference between dimension one and two. It seems interesting to analyze the result 
proved in the preceding section, to see precisely which obstacle prevents us from constructing a 
3-dot-typc counterexample in dimension one, although it is allowed in dimension two. At which 
point docs the one-dimensional argument dcvcloppcd above stop applying to the case of dimension 
two? 

First, note that the reduction of Rohlin's question to zero-entropy processes is still valid in the 
case of two-dimensional random fields: We can still consider in this case the Pinsker tr-algebra 
n(£), with respect to which every zero-entropy factor of £ is measurable, and prove that if £ is 
2-fold but not 3- fold mixing, then this property comes from II(£). (For some presentations of 
the Pinsker cr-algcbra in the multidimensional case, we refer the reader to [Tl [SI 11UI lllj . For the 
generalization of Kriegcr's finite generator theorem to Z d -actions, see e.g. [151 15*]. ) As far as 
Lcdrappier's 3-dot example is concerned, there is no need to take a factor since this stationary 
random field already has zero entropy. 

Next, Lemma [3.61 gives that if £ is a finite- valued stationary random field indexed by TL 2 ad- 
mitting a 3-dot-type self joining, then the distribution of £ on any finite window is uniform on 
all the configurations which are seen with positive probability. And indeed, we can easily check 
that this is the case for Ledrappier's construction (for example, any allowed configuration in a 
rectangular l\ x £ 2 window has probability 2~^ 1+ ^ 2-1 '). And this seems to be the point of the ar- 
gument where there is a difference between TL and TL 2 : Although this uniform-probability property 
implies periodicity for one-dimensional zero-entropy stationary processes, this is no more true for 
two-dimensional stationary random fields, as we can see with Ledrappier's example. 

We can also observe that in the one-dimensional case, if the stationarity property is dropped the 
argument fails at the same point: The relative counterexample constructed in Section 12.21 that is 
to say the process conditioned on the skeleton sequence, also has the uniform-probability property 
(which means in the non-stationary setting that for any i < j, all the possible configurations for 
the sequence £j have the same probability). However, this process has zero entropy: There exists 
almost surely some k such that Sk = 2, therefore £o lies in the third fc-block in its (k + l)-block. 
Then £o can be computed from the first two fc-blocks, which are measurable with respect to the 
past of the process. 

4. Further questions 

Many important results have already been presented around Rohlin's multifold mixing problem. 
Most of them consider some special category of stationary processes {e.g. finite-rank processes, or 
processes with singular spectrum), and prove that in this category a pairwise- independent 3-fold 
self-joining has to be the product measure. 

We hope that the work presented here can be the beginning of a slightly different approach: 
Consider a stationary process which admits some kind of pairwise-independent 3-fold self-joining 
which is not the product measure, and see which other properties on the process this assumption 
entails. In this direction, the most natural generalization of the present study should be the 
following situation: Take a stationary process £ which admits a pairwise-independent 3-fold self- 
joining A, and assume that the three coordinates £, £' and £" of this self-joining satisfy 

(14) e = ¥>(£,0 A-a.s. 

for some measurable function <p : A z x A z — > A z . Then what can be said on the process £? Can 
it have zero entropy and be 2-fold mixing? 

Since this general question seems to be quite difficult, some more restricted classes of pairwise 
independent self-joinings could be considered first, satisfying l|14|l with some regularity assumption 
on ip. For example, what happens if under A, the coordinate £q is a function of finitely many 
coordinates of £ and £'? 
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